Web-page recognition algorithm estimating text coherence

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimating the Rate of Web Page Updates

Estimating the rate of Web page updates helps in improving Web crawler’s scheduling policy. But, most of the Web sources are autonomous and updated independently. Clients like Web crawlers are not aware of when and how often the sources change. Unlike other studies, we model the process of Web page updates as non-homogeneous Poisson process and focus on determining localized rate of updates. Th...

متن کامل

A Score based Web Page Ranking Algorithm

With the explosive growth of information in the Web, users face difficulties while finding their desired information. Search engine helps the user by retrieving useful information from this huge collection based on his/her search query and presents a list of relevant web pages as a search result. However, without proper ranking of pages in the result through the relevancy of pages to the search...

متن کامل

A Marker Propagation Algorithm for Text Coherence

Text coherence is a di cult problem in natural language processing A text is considered to be coherent when sentences follow logically one after the other In this paper we describe a computational method that provides an explanation why a text is coherent By providing such an explanation one can infer a number of assertions unstated in a text Our computational method is based on a parallel mark...

متن کامل

A Novel Approach to Feature Selection Using PageRank algorithm for Web Page Classification

In this paper, a novel filter-based approach is proposed using the PageRank algorithm to select the optimal subset of features as well as to compute their weights for web page classification. To evaluate the proposed approach multiple experiments are performed using accuracy score as the main criterion on four different datasets, namely WebKB, Reuters-R8, Reuters-R52, and 20NewsGroups. By analy...

متن کامل

Web Page Ranking Based on Text Substance of Linked Pages

World Wide Web is large sized repository of interlinked hypertext documents accessed via the Internet. Web may contain text, images, video, and other multimedia data. The user navigates through this using hyperlink. Search Engine gives millions of results and applies Web mining techniques to order the results. The sorted order of search results is obtained by applying some special algorithms ca...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: On-line Journal "Naukovedenie"

سال: 2015

ISSN: 2223-5167

DOI: 10.15862/71tvn115